Computing Solutions in Infinite-Horizon Discounted Adversarial Patrolling Games

نویسندگان

Yevgeniy Vorobeychik

Bo An

Milind Tambe

Satinder P. Singh

چکیده

Stackelberg games form the core of a number of tools deployed for computing optimal patrolling strategies in adversarial domains, such as the US Federal Air Marshall Service and the US Coast Guard. In traditional Stackelberg security game models the attacker knows only the probability that each target is covered by the defender, but is oblivious to the detailed timing of the coverage schedule. In many real-world situations, however, the attacker can observe the current location of the defender and can exploit this knowledge to reason about the defender’s future moves. We show that this general modeling framework can be captured using adversarial patrolling games (APGs) in which the defender sequentially moves between targets, with moves constrained by a graph, while the attacker can observe the defender’s current location and his (stochastic) policy concerning future moves. We offer a very general model of infinite-horizon discounted adversarial patrolling games. Our first contribution is to show that defender policies that condition only on the previous defense move (i.e., Markov stationary policies) can be arbitrarily suboptimal for general APGs. We then offer a mixed-integer nonlinear programming (MINLP) formulation for computing optimal randomized policies for the defender that can condition on history of bounded, but arbitrary, length, as well as a mixed-integer linear programming (MILP) formulation to approximate these, with provable quality guarantees. Additionally, we present a non-linear programming (NLP) formulation for solving zero-sum APGs. We show experimentally that MILP significantly outperforms the MINLP formulation, and is, in turn, significantly outperformed by the NLP specialized to zero-sum games.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Security Games with Interval Uncertainty

Security games provide a framework for allocating limited security resources in adversarial domains, and are currently used in applications including security at the LAX airport, scheduling for the Federal Air Marshals, and patrolling strategies for the U.S. Coast Guard. One of the major challenges in security games is finding solutions that are robust to uncertainty about the game model. Bayes...

متن کامل

Adversarial Patrolling Games

Defender-Attacker Stackelberg games are the foundations of tools deployed for computing optimal patrolling strategies in adversarial domains such as the United states Federal Air Marshals Service and the United States Coast Guard, among others. In Stackelberg game models of these systems the attacker knows only the probability that each target is covered by the defender, but is oblivious to the...

متن کامل

Stability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games

We consider a non-cooperative game in infinite time horizon, with linear dynamics and exponentially discounted quadratic costs. Assuming that the state space is onedimensional, we prove that the Nash equilibrium solution in feedback form is stable under nonlinear perturbations. The analysis shows that, in a generic setting, the linear-quadratic game can have either one or infinitely many feedba...

متن کامل

Infinite horizon differential games for abstract evolution equations

Berkovitz’s notion of strategy and payoff for differential games is extended to study two player zero-sum infinite dimensional differential games on the infinite horizon with discounted payoff. After proving dynamic programming inequalities in this framework, we establish the existence and characterization of value. We also construct a saddle point for the game. Mathematical subject classificat...

متن کامل